618 results found.
Written
Web Service,
Language Type:
Trilingual
Languages:
Dutch English German
Availability:
Freely Available
License:
<Not Specified>
Size:
2095000 entries Production Status:
Newly created-in progress
Use:
Variational Linguistics/Computational Sociolinguistics
-
Paper title:Exploring Language Variation Across Europe - A Web-based Tool for Computational Sociolinguistics
-
Paper track:Written
-
Paper status:Accept Poster+Demo
| Author Number | Name | Affiliation | Country | ||
|---|---|---|---|---|---|
| Author 1 | Dirk Hovy | Center for Language Technology, University of Copenhagen | DK | Center for Language Technology, University of Copenhagen | IT |
| Author 2 | Anders Johannsen | University of Copenhagen | DK | ||
| Main Contact | Dirk Hovy | Center for Language Technology, University of Copenhagen | None | Bocconi University | None |
Documentation:
<Not Specified>
Written
Corpus,
Language Type:
Multilingual
Languages:
Czech English German Russian Spanish
Availability:
Evaluation
License:
Unspecified
Size:
56 MByte Production Status:
Existing-used
Use:
Machine Translation, SpeechToSpeech Translation
-
Paper title:Improving Evaluation of English-Czech MT through Paraphrasing
-
Paper track:Evaluation
-
Paper status:Accept Poster
| Author Number | Name | Affiliation | Country |
|---|---|---|---|
| Author 1 | Petra Barancikova | Charles University in Prague, Faculty of Mathematics and Physics | CZ |
| Author 2 | Rudolf Rosa | Charles University in Prague, Faculty of Mathematics and Physics | CZ |
| Author 3 | Ales Tamchyna | Charles University in Prague, Faculty of Mathematics and Physics | CZ |
| Main Contact | Petra Barancikova | Charles University in Prague, Faculty of Mathematics and Physics | None |
Documentation:
<Not Specified>
Written
Corpus,
Language Type:
Multilingual
Languages:
English German french italian
Availability:
Freely Available
License:
Open source for annotations; license for source text as stated in the paper
Size:
20 000 000 words Production Status:
Newly created-finished
Use:
Machine Translation, SpeechToSpeech Translation
-
Paper title:SwissAdmin: A multilingual tagged parallel corpus of press releases
-
Paper track:Written
-
Paper status:Accept Poster
| Author Number | Name | Affiliation | Country | ||
|---|---|---|---|---|---|
| Author 1 | Yves Scherrer | LATL-CUI, Université de Genève | FI | ||
| Author 2 | Luka Nerima | LATL-CUI, Université de Genève | CH | LATL-University of Geneva | None |
| Author 3 | Lorenza Russo | LATL-CUI, Université de Genève | CH | ||
| Author 4 | Maria Ivanova | LATL-CUI, Université de Genève | CH | ||
| Author 5 | Eric Wehrli | LATL-CUI, Université de Genève | CH | ||
| Main Contact | Yves Scherrer | University of Helsinki | None |
Documentation:
The paper itself documents the corpus.Language Type:
Multilingual
Languages:
English German Spanish french italian
Availability:
Freely Available
License:
<Not Specified>
Size:
200000 entries Production Status:
Newly created-finished
Use:
Document Classification, Text categorisation
-
Paper title:A Corpus for Multilingual Document Classification in Eight Languages
-
Paper track:Evaluation
-
Paper status:Accept Oral
| Author Number | Name | Affiliation | Country |
|---|---|---|---|
| Author 1 | Holger Schwenk | Facebook AI Research | FR |
| Author 2 | Xian Li | US | |
| Main Contact | Holger Schwenk | Facebook AI Research | None |
Documentation:
<Not Specified>
Written
Lexicon,
Language Type:
Multilingual
Languages:
English German Mandarin Chinese Slovenian Spanish
Availability:
Freely Available
License:
<Not Specified>
Size:
25 GB Production Status:
Newly created-finished
Use:
Semantic Web
-
Paper title:xLiD-Lexica: Cross-lingual Linked Data Lexica
-
Paper track:Written
-
Paper status:Accept Poster
| Author Number | Name | Affiliation | Country |
|---|---|---|---|
| Author 1 | Lei Zhang | Karlsruhe Institute of Technology | DE |
| Author 2 | Michael Färber | Karlsruhe Institute of Technology | DE |
| Author 3 | Achim Rettinger | Karlsruhe Institute of Technology | DE |
| Main Contact | Lei Zhang | Karlsruhe Institute of Technology | None |
Documentation:
<Not Specified>
Written
Evaluation Data,
Language Type:
Multilingual
Languages:
Dutch German Russian french italian
Availability:
Freely Available
License:
<Not Specified>
Size:
471 KByte Production Status:
Newly created-finished
Use:
Evaluation/Validation
-
Paper title:SemR-11: A Multi-Lingual Gold-Standard for Semantic Similarity and Relatedness for Eleven Languages
-
Paper track:Evaluation
-
Paper status:Accept Poster
| Author Number | Name | Affiliation | Country | ||||
|---|---|---|---|---|---|---|---|
| Author 1 | Siamak Barzegar | National University of Ireland, Galway | IE | ||||
| Author 2 | Brian Davis | Maynooth University | IE | ||||
| Author 3 | Manel Zarrouk | INSIGHT | IE | ||||
| Author 4 | Siegfried Handschuh | University of Passau | DE | Universität Passau | DE | Passau University | DE |
| Author 5 | André Freitas | University of Manchester | GB | ||||
| Main Contact | Siamak Barzegar | National University of Ireland, Galway | None |
Documentation:
<Not Specified>Language Type:
Multilingual
Languages:
American English German Korean Mandarin Chinese Standard Arabic
Availability:
tbd
License:
TBD
Size:
9000 sentences Production Status:
Newly created-in progress
Use:
Natural Language Generation
-
Paper title:A Database for Measuring Linguistic Information Content
-
Paper track:Infrastructural Issues/Large Projects
-
Paper status:Accept Poster
| Author Number | Name | Affiliation | Country | ||
|---|---|---|---|---|---|
| Author 1 | Richard Sproat | US | |||
| Author 2 | Bruno Cartoni | US | |||
| Author 3 | HyunJeong Choe | KR | |||
| Author 4 | David Huynh | US | |||
| Author 5 | Linne Ha | US | |||
| Author 6 | Ravindran Rajakumar | US | |||
| Author 7 | Evelyn Wenzel-Grondie | US | |||
| Main Contact | Richard Sproat | None | None | None |
Documentation:
TBD
Written
Lexicon,
Language Type:
Multilingual
Languages:
German Gothic Icelandic Old English Old High German
Availability:
Freely Available
License:
CC-BY
Size:
<Not Specified> Production Status:
Newly created-in progress
Use:
Digital Humanities
-
Paper title:Linking Etymological Databases. A case study in Germanic
-
Paper track:long paper
-
Paper status:Accept
| Author Number | Name | Affiliation | Country | ||
|---|---|---|---|---|---|
| Author 1 | Christian Chiarcos | Goethe-Universität Frankfurt am Main | DE | Universitaet Frankfurt am Main | DE |
| Author 2 | Maria Sukhareva | Goethe University Frankfurt | DE | ||
| Main Contact | Christian Chiarcos | Goethe-Universität Frankfurt am Main | None |
Documentation:
<Not Specified>Language Type:
Multilingual
Languages:
English German Portuguese Russian Turkish
Availability:
Not Available
License:
-
Size:
38000 words Production Status:
Newly created-in progress
Use:
Discourse
-
Paper title:Multilingual Extension of PDTB-Style Annotation: The Case of TED Multilingual Discourse Bank
-
Paper track:Written
-
Paper status:Accept Poster
| Author Number | Name | Affiliation | Country |
|---|---|---|---|
| Author 1 | Deniz Zeyrek | Middle East Technical University | TR |
| Author 2 | Amália Mendes | Centre for Linguistics of the University of Lisbon | PT |
| Author 3 | Murathan Kurfalı | Middle East Technical University | TR |
| Main Contact | Deniz Zeyrek | Middle East Technical University | None |
Documentation:
An annotation manual in English exists. Currently only available for the annotators.
Multimodal/Multimedia
Repository,
Language Type:
Multilingual
Languages:
English German German Sign Language Japanese
Availability:
Freely Available
License:
<Not Specified>
Size:
2.62 TB OtherProduction Status:
Existing-updated
Use:
Phonetics, Speech Recognition, Machine Translation, etc.
-
Paper title:The BAS Speech Data Repository
-
Paper track:Infrastructural Issues/Large Projects
-
Paper status:Accept Poster
| Author Number | Name | Affiliation | Country |
|---|---|---|---|
| Author 1 | Uwe Reichel | Hungarian Academy of Sciences | HU |
| Author 2 | Florian Schiel | Bavarian Archive for Speech Signals | DE |
| Author 3 | Thomas Kisler | University of Munich | DE |
| Author 4 | Christoph Draxler | Institute of Phonetics and Speech Processing, LMU Munich | DE |
| Author 5 | Nina Pörner | University of Munich | DE |
| Main Contact | Uwe Reichel | Hungarian Academy of Sciences | None |
Documentation:
Corpus documentations (mainly German, English) available on the corpus landing pages




